Skip to main content

Internet Archive Web Crawls

The Internet Archive discovers and captures web pages through many different web crawls. At any given time several distinct crawls are running, some for months, and some every day or longer. View the web archive through the Wayback Machine.

1,180,490
RESULTS
rss


Media Type
112
collections
1,178,846
web
1,532
data
Year
107,978
2018
204,615
2017
167,853
2016
139,882
2015
100,829
2014
138,510
2013
More right-solid
Topics & Subjects
1,030,632
crawldata
2,263
no404
1,452
wikipedia
811
wordpress
252
amazonbooks
7
end of term
More right-solid
Collection
More right-solid
Creator
1,023,265
internet archive
73,846
internetarchive
32,949
alexa internet
1,275
university of north texas
31
google, inc.
3
lekash@archive.org
More right-solid
Language
3,915
English
SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
Live Web Proxy Crawls
Jul 30, 2017 Internet Archive
web
eye 809,212
favorite 0
comment 1
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live3.us.archive.org from 2013-08-14T05:36:46 UTC to 2013-08-14T16:31:43 UTC.
favoritefavorite ( 1 reviews )
Topic: crawldata
google ngrams
Dec 26, 2013 Google, Inc.
web
eye 63
favorite 0
comment 1
This item contains the Google ngram data for the American English languageset. ​​​​​ Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the numbered links below will directly download a fragment of the given corpus. For instance, the first ten links...
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Live Web Proxy Crawls
Aug 6, 2013 Internet Archive
web
eye 712,126
favorite 0
comment 1
Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live4.us.archive.org from 2013-07-13T11:35:01 UTC to 2013-07-14T12:32:30 UTC.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Fri Feb 17 07:30:11 PST 2017 to Thu Feb 16 23:42:38 PST 2017.
Topic: crawldata
Youtube Videos
web
eye 16
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Thu Mar 30 13:45:29 PDT 2017 to Thu Mar 30 08:18:11 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 12
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Thu Aug 11 17:03:22 PDT 2016 to Thu Aug 11 10:46:58 PDT 2016.
Topic: crawldata
Youtube Videos
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Fri Jul 29 11:58:20 PDT 2016 to Fri Jul 29 05:20:38 PDT 2016.
Topic: crawldata
crawl_UNK
web
eye 12
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web
eye 9
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Fri Feb 26 21:09:53 PST 2016 to Fri Feb 26 13:25:32 PST 2016.
Topic: crawldata
crawl_UNK
web
eye 18
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web
eye 4
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl822.us.archive.org:youtube from Sat Sep 3 22:37:24 PDT 2016 to Sat Sep 3 15:58:03 PDT 2016.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 9
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl822.us.archive.org:youtube from Fri Jul 29 00:35:38 PDT 2016 to Thu Jul 28 18:04:27 PDT 2016.
Topic: crawldata
Youtube Videos
web
eye 8
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Mon Sep 7 00:29:02 PDT 2015 to Sun Sep 6 18:39:38 PDT 2015.
Topic: crawldata
Youtube Videos
web
eye 9
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl444.us.archive.org:youtube from Thu Feb 6 06:08:21 PST 2014 to Wed Feb 5 22:27:57 PST 2014.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 11
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl822.us.archive.org:youtube from Fri Nov 25 06:55:30 PST 2016 to Thu Nov 24 23:48:59 PST 2016.
Topic: crawldata
Youtube Videos
web
eye 26
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Thu Mar 6 02:57:59 PST 2014 to Wed Mar 5 20:27:42 PST 2014.
Topic: crawldata
Youtube Videos
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Sun Feb 23 03:19:23 PST 2014 to Sat Feb 22 20:18:59 PST 2014.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 17
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Sun Apr 3 09:30:45 PDT 2016 to Sun Apr 3 03:12:12 PDT 2016.
Topic: crawldata
Youtube Videos
web
eye 16
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl459.us.archive.org:youtube from Tue Aug 13 06:20:42 PDT 2013 to Mon Aug 12 23:45:54 PDT 2013.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl444.us.archive.org:youtube from Sat Mar 19 19:48:50 PDT 2016 to Sat Mar 19 13:52:12 PDT 2016.
Topic: crawldata
crawl_UNK
web
eye 28
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Mon Jul 1 06:03:53 PDT 2013 to Mon Jul 1 00:01:23 PDT 2013.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 8
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl444.us.archive.org:youtube from Tue Dec 9 03:15:21 PST 2014 to Mon Dec 8 19:38:59 PST 2014.
Topic: crawldata
Youtube Videos
web
eye 9
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Thu Dec 5 02:35:32 PST 2013 to Wed Dec 4 19:31:45 PST 2013.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Fri Nov 3 23:38:20 PDT 2017 to Fri Nov 3 17:03:40 PDT 2017.
Topic: crawldata
crawl_UNK
web
eye 5
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Sat Oct 1 23:28:51 PDT 2016 to Sat Oct 1 16:56:55 PDT 2016.
Topic: crawldata
Youtube Videos
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Wed Jun 22 17:58:46 PDT 2016 to Wed Jun 22 11:13:44 PDT 2016.
Topic: crawldata
Youtube Videos
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Fri Aug 11 18:52:24 PDT 2017 to Fri Aug 11 12:08:01 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 10
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl444.us.archive.org:youtube from Thu Jun 13 12:25:58 PDT 2013 to Thu Jun 13 06:02:52 PDT 2013.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 8
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Thu Mar 31 10:46:18 PDT 2016 to Thu Mar 31 04:02:48 PDT 2016.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 14
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl459.us.archive.org:youtube from Wed Oct 16 10:45:27 PDT 2013 to Wed Oct 16 04:03:34 PDT 2013.
Topic: crawldata
Youtube Videos
web
eye 4
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl442.us.archive.org:youtube from Wed Oct 25 09:27:45 PDT 2017 to Wed Oct 25 03:14:45 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Wed Mar 8 18:56:21 PST 2017 to Wed Mar 8 12:02:37 PST 2017.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl823.us.archive.org:youtube from Sat Aug 12 19:46:00 PDT 2017 to Sat Aug 12 12:57:51 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Tue Mar 28 07:29:35 PDT 2017 to Tue Mar 28 01:05:41 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl822.us.archive.org:youtube from Tue Sep 4 08:56:57 PDT 2018 to Tue Sep 4 02:32:39 PDT 2018.
Topic: crawldata
Youtube Videos
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Tue Aug 28 00:50:10 PDT 2018 to Mon Aug 27 18:10:46 PDT 2018.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Mon Oct 30 15:39:38 PDT 2017 to Mon Oct 30 09:06:09 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl810.us.archive.org:youtube from Thu Sep 28 10:19:24 PDT 2017 to Thu Sep 28 03:33:50 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl449.us.archive.org:youtube from Sun Oct 1 13:04:07 PDT 2017 to Sun Oct 1 06:53:20 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Sat Sep 1 22:46:54 PDT 2018 to Sat Sep 1 16:19:41 PDT 2018.
Topic: crawldata
Youtube Videos
web
eye 93
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Mon Jun 12 16:13:43 PDT 2017 to Mon Jun 12 09:27:37 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Mon Aug 21 21:19:30 PDT 2017 to Mon Aug 21 14:34:39 PDT 2017.
Topic: crawldata
Youtube Videos
web
eye 13
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl444.us.archive.org:youtube from Sun May 24 23:29:40 PDT 2015 to Sun May 24 17:23:19 PDT 2015.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 14
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Wed Jun 14 22:26:12 PDT 2017 to Wed Jun 14 15:45:10 PDT 2017.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
crawl_UNK
web
eye 10
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
Youtube Videos
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Tue Nov 7 16:41:16 PST 2017 to Tue Nov 7 09:10:16 PST 2017.
Topic: crawldata
Youtube Videos
web
eye 4
favorite 0
comment 0
Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl410.us.archive.org:youtube from Sun Oct 6 05:28:20 PDT 2013 to Sat Oct 5 23:07:41 PDT 2013.
Topic: crawldata
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
crawl_UNK
web
eye 42
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
crawl_UNK
web
eye 70
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
crawl_UNK
web
eye 29
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
crawl_UNK
web
eye 40
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
crawl_UNK
web
eye 14
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet
crawl_UNK
web
eye 30
favorite 0
comment 0
This is an element of the Crawl UNK dataset from Alexa Internet